Overview

Dataset info

Number of variables45
Number of observations532428
Missing cells2380715 (9.9%)
Duplicate rows0 (0.0%)
Total size in memory182.8 MiB
Average record size in memory360.0 B

Variables types

Numeric24
Categorical17
Boolean2
Date0
URL0
Text (Unique)0
Rejected2
Unsupported0

Warnings

acc_now_delinq is highly skewed (γ1 = 27.64041058) Skewed
acc_now_delinq has 529949 (99.5%) zeros Zeros
addr_state has a high cardinality: 51 distinct values Warning
annual_inc is highly skewed (γ1 = 44.77621587) Skewed
batch_enrolled has a high cardinality: 105 distinct values Warning
batch_enrolled has 85149 (16.0%) missing values Missing
collection_recovery_fee is highly skewed (γ1 = 30.83826071) Skewed
collection_recovery_fee has 518423 (97.4%) zeros Zeros
collections_12_mths_ex_med has 525346 (98.7%) zeros Zeros
delinq_2yrs has 430104 (80.8%) zeros Zeros
desc has a high cardinality: 70639 distinct values Warning
desc has 456829 (85.8%) missing values Missing
emp_length has 26891 (5.1%) missing values Missing
emp_title has a high cardinality: 190125 distinct values Warning
emp_title has 30833 (5.8%) missing values Missing
funded_amnt_inv is highly correlated with funded_amnt (ρ = 0.9980407483) Rejected
inq_last_6mths has 298854 (56.1%) zeros Zeros
last_week_pay has a high cardinality: 98 distinct values Warning
loan_amnt is highly correlated with funded_amnt_inv (ρ = 0.9971235007) Rejected
mths_since_last_delinq has 272554 (51.2%) missing values Missing
mths_since_last_major_derog has 399448 (75.0%) missing values Missing
mths_since_last_record has 450305 (84.6%) missing values Missing
pub_rec has 451040 (84.7%) zeros Zeros
recoveries has 517723 (97.2%) zeros Zeros
title has a high cardinality: 39694 distinct values Warning
tot_coll_amt is highly skewed (γ1 = 61.64220691) Skewed
tot_coll_amt has 420903 (79.1%) zeros Zeros
tot_coll_amt has 42004 (7.9%) missing values Missing
tot_cur_bal has 42004 (7.9%) missing values Missing
total_rec_int has 10953 (2.1%) zeros Zeros
total_rec_late_fee has 524986 (98.6%) zeros Zeros
total_rev_hi_lim is highly skewed (γ1 = 77.3834345) Skewed
total_rev_hi_lim has 42004 (7.9%) missing values Missing
verification_status_joint has 532123 (99.9%) missing values Missing
zip_code has a high cardinality: 917 distinct values Warning

Variables

acc_now_delinq
Numeric

Distinct count9
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean0.005014913263
Minimum0
Maximum14
Zeros (%)99.5%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum14
Range14
Interquartile range0

Descriptive statistics

Standard deviation0.07911680643
Coef of variation15.77630605
Kurtosis2290.336321
Mean0.005014913263
MAD0.009983427378
Skewness27.64041058
Sum2670
Variance0.00625946906
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
0 529949 99.5%
 
1 2304 0.4%
 
2 134 < 0.1%
 
3 16 < 0.1%
 
4 5 < 0.1%
 
5 2 < 0.1%
 
6 1 < 0.1%
 
14 1 < 0.1%
 
(Missing) 16 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 529949 99.5%
 
1 2304 0.4%
 
2 134 < 0.1%
 
3 16 < 0.1%
 
4 5 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
14 1 < 0.1%
 
6 1 < 0.1%
 
5 2 < 0.1%
 
4 5 < 0.1%
 
3 16 < 0.1%
 

addr_state
Categorical

Distinct count51
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
CA
77911
NY
 
44406
TX
 
42527
Other values (48)
367584
ValueCountFrequency (%) 
CA 77911 14.6%
 
NY 44406 8.3%
 
TX 42527 8.0%
 
FL 36575 6.9%
 
IL 21205 4.0%
 
NJ 20103 3.8%
 
PA 18882 3.5%
 
OH 17778 3.3%
 
GA 17292 3.2%
 
VA 15826 3.0%
 
Other values (41) 219923 41.3%
 
Max length2
Mean length2
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

annual_inc
Numeric

Distinct count33989
Unique (%)6.4%
Missing (%)< 0.1%
Missing (n)3
Infinite (%)0.0%
Infinite (n)0
Mean75029.84329
Minimum1200
Maximum9500000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1200
5-th percentile28000
Q145000
Median65000
Q390000
95-th percentile150000
Maximum9500000
Range9498800
Interquartile range45000

Descriptive statistics

Standard deviation65199.84501
Coef of variation0.8689854884
Kurtosis4815.473459
Mean75029.84329
MAD31746.56362
Skewness44.77621587
Sum3.994776431e+10
Variance4251019790
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
60000 20558 3.9%
 
50000 18363 3.4%
 
65000 15420 2.9%
 
70000 14493 2.7%
 
40000 14352 2.7%
 
80000 13621 2.6%
 
45000 13589 2.6%
 
75000 13372 2.5%
 
55000 12416 2.3%
 
90000 10305 1.9%
 
Other values (33978) 385936 72.5%
 

Minimum 5 values

ValueCountFrequency (%) 
1200 1 < 0.1%
 
1896 1 < 0.1%
 
2000 1 < 0.1%
 
3000 2 < 0.1%
 
3300 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9500000 1 < 0.1%
 
8900060 1 < 0.1%
 
8706582 1 < 0.1%
 
8500021 1 < 0.1%
 
8253000 1 < 0.1%
 

application_type
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
INDIVIDUAL
532123
JOINT
 
305
ValueCountFrequency (%) 
INDIVIDUAL 532123 99.9%
 
JOINT 305 0.1%
 
Max length10
Mean length9.997135763
Min length5
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

batch_enrolled
Categorical

Distinct count105
Unique (%)< 0.1%
Missing (%)16.0%
Missing (n)85149
106079
BAT2252229
 
18791
BAT3873588
 
17839
Other values (101)
304570
(Missing)
85149
ValueCountFrequency (%) 
106079 19.9%
 
BAT2252229 18791 3.5%
 
BAT3873588 17839 3.4%
 
BAT2803411 17111 3.2%
 
BAT2078974 14859 2.8%
 
BAT1586599 14463 2.7%
 
BAT1780517 13918 2.6%
 
BAT1104812 13505 2.5%
 
BAT4694572 13504 2.5%
 
BAT1184694 12251 2.3%
 
Other values (94) 204959 38.5%
 
(Missing) 85149 16.0%
 
Max length10
Mean length7.069098545
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

collection_recovery_fee
Numeric

Distinct count12617
Unique (%)2.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4.859220747
Minimum0
Maximum7002.19
Zeros (%)97.4%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum7002.19
Range7002.19
Interquartile range0

Descriptive statistics

Standard deviation63.1233612
Coef of variation12.99042881
Kurtosis1913.975438
Mean4.859220747
MAD9.482567814
Skewness30.83826071
Sum2587185.184
Variance3984.558729
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 1.80000000e-02 3.37650000e-01 1.15970000e+00 1.16010000e+00 ... 1.09840635e+03 1.20522600e+03 1.62921850e+03 2.61221840e+03 7.00219000e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 518423 97.4%
 
2 13 < 0.1%
 
2.73 10 < 0.1%
 
3.09 9 < 0.1%
 
1.8 9 < 0.1%
 
3.71 9 < 0.1%
 
1.55 8 < 0.1%
 
2.52 8 < 0.1%
 
2.61 7 < 0.1%
 
1.45 7 < 0.1%
 
Other values (12607) 13925 2.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 518423 97.4%
 
0.036 1 < 0.1%
 
0.0449999999 1 < 0.1%
 
0.063 1 < 0.1%
 
0.0710999972 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
7002.19 1 < 0.1%
 
6972.59 1 < 0.1%
 
6543.04 1 < 0.1%
 
5774.8 1 < 0.1%
 
5694.0936 1 < 0.1%
 

collections_12_mths_ex_med
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)95
Infinite (%)0.0%
Infinite (n)0
Mean0.01429932016
Minimum0
Maximum16
Zeros (%)98.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum16
Range16
Interquartile range0

Descriptive statistics

Standard deviation0.1330051752
Coef of variation9.301503408
Kurtosis778.8263299
Mean0.01429932016
MAD0.02822327622
Skewness15.76053149
Sum7612
Variance0.01769037664
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 525346 98.7%
 
1 6494 1.2%
 
2 416 0.1%
 
3 53 < 0.1%
 
4 15 < 0.1%
 
5 6 < 0.1%
 
7 1 < 0.1%
 
16 1 < 0.1%
 
14 1 < 0.1%
 
(Missing) 95 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 525346 98.7%
 
1 6494 1.2%
 
2 416 0.1%
 
3 53 < 0.1%
 
4 15 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
16 1 < 0.1%
 
14 1 < 0.1%
 
7 1 < 0.1%
 
5 6 < 0.1%
 
4 15 < 0.1%
 

delinq_2yrs
Numeric

Distinct count27
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean0.3144482093
Minimum0
Maximum30
Zeros (%)80.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum30
Range30
Interquartile range0

Descriptive statistics

Standard deviation0.860044949
Coef of variation2.735092532
Kurtosis52.75034207
Mean0.3144482093
MAD0.5080480252
Skewness5.373310276
Sum167416
Variance0.7396773143
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%) 
0 430104 80.8%
 
1 67947 12.8%
 
2 20167 3.8%
 
3 7269 1.4%
 
4 3159 0.6%
 
5 1622 0.3%
 
6 898 0.2%
 
7 451 0.1%
 
8 266 < 0.1%
 
9 166 < 0.1%
 
Other values (16) 363 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 430104 80.8%
 
1 67947 12.8%
 
2 20167 3.8%
 
3 7269 1.4%
 
4 3159 0.6%
 

Maximum 5 values

ValueCountFrequency (%) 
30 1 < 0.1%
 
27 1 < 0.1%
 
26 1 < 0.1%
 
24 1 < 0.1%
 
22 1 < 0.1%
 

desc
Categorical

Distinct count70639
Unique (%)13.3%
Missing (%)85.8%
Missing (n)456829
> Debt consolidation<br>
 
576
> Debt Consolidation<br>
 
372
> debt consolidation<br>
 
347
Other values (70635)
 
74304
(Missing)
456829
ValueCountFrequency (%) 
> Debt consolidation<br> 576 0.1%
 
> Debt Consolidation<br> 372 0.1%
 
> debt consolidation<br> 347 0.1%
 
> Debt consolidation.<br> 131 < 0.1%
 
> Pay off credit cards<br> 122 < 0.1%
 
> Credit card consolidation<br> 103 < 0.1%
 
> pay off credit cards<br> 101 < 0.1%
 
> credit card consolidation<br> 68 < 0.1%
 
> Consolidation<br> 65 < 0.1%
 
> To pay off credit cards<br> 52 < 0.1%
 
Other values (70628) 73662 13.8%
 
(Missing) 456829 85.8%
 
Max length3966
Mean length32.08913318
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

dti
Numeric

Distinct count4058
Unique (%)0.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean18.13876663
Minimum0
Maximum672.52
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile5.2
Q111.93
Median17.65
Q323.95
95-th percentile32.74
Maximum672.52
Range672.52
Interquartile range12.02

Descriptive statistics

Standard deviation8.369074218
Coef of variation0.4613915813
Kurtosis76.36353877
Mean18.13876663
MAD6.811434912
Skewness1.28974002
Sum9657587.24
Variance70.04140327
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-03 1.0500e-01 2.4500e-01 4.9500e-01 ... 3.9985e+01 4.0020e+01 4.7460e+01 7.2060e+01 6.7252e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
14.4 410 0.1%
 
19.2 399 0.1%
 
18 390 0.1%
 
16.8 385 0.1%
 
15.6 380 0.1%
 
13.2 374 0.1%
 
12 355 0.1%
 
20.4 353 0.1%
 
21.6 348 0.1%
 
22.8 324 0.1%
 
Other values (4048) 528710 99.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 265 < 0.1%
 
0.01 5 < 0.1%
 
0.02 7 < 0.1%
 
0.03 4 < 0.1%
 
0.04 4 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
672.52 1 < 0.1%
 
380.53 1 < 0.1%
 
137.4 1 < 0.1%
 
120.66 1 < 0.1%
 
104 1 < 0.1%
 

emp_length
Categorical

Distinct count12
Unique (%)< 0.1%
Missing (%)5.1%
Missing (n)26891
10+ years
175105
2 years
 
47276
< 1 year
 
42253
Other values (8)
240903
ValueCountFrequency (%) 
10+ years 175105 32.9%
 
2 years 47276 8.9%
 
< 1 year 42253 7.9%
 
3 years 42175 7.9%
 
1 year 34202 6.4%
 
5 years 33393 6.3%
 
4 years 31581 5.9%
 
7 years 26680 5.0%
 
8 years 26443 5.0%
 
6 years 25741 4.8%
 
(Missing) 26891 5.1%
 
Max length9
Mean length7.470856153
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

emp_title
Categorical

Distinct count190125
Unique (%)35.7%
Missing (%)5.8%
Missing (n)30833
Teacher
 
8280
Manager
 
6922
Registered Nurse
 
3387
Other values (190121)
483006
(Missing)
 
30833
ValueCountFrequency (%) 
Teacher 8280 1.6%
 
Manager 6922 1.3%
 
Registered Nurse 3387 0.6%
 
Owner 3305 0.6%
 
RN 3255 0.6%
 
Supervisor 3215 0.6%
 
Sales 2668 0.5%
 
Project Manager 2473 0.5%
 
Office Manager 2189 0.4%
 
Driver 2187 0.4%
 
Other values (190114) 463714 87.1%
 
(Missing) 30833 5.8%
 
Max length75
Mean length15.33395877
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

funded_amnt
Numeric

Distinct count1370
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean14744.27129
Minimum500
Maximum35000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum500
5-th percentile3600
Q18000
Median13000
Q320000
95-th percentile32000
Maximum35000
Range34500
Interquartile range12000

Descriptive statistics

Standard deviation8429.139277
Coef of variation0.5716891063
Kurtosis-0.2537829808
Mean14744.27129
MAD6865.132918
Skewness0.6833169317
Sum7850262875
Variance71050388.96
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 500. 525. 962.5 1012.5 1087.5 ... 34812.5 34862.5 34887.5 34987.5 35000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10000 37118 7.0%
 
12000 30069 5.6%
 
15000 28345 5.3%
 
20000 28085 5.3%
 
35000 21650 4.1%
 
8000 16656 3.1%
 
5000 16275 3.1%
 
6000 15624 2.9%
 
25000 14271 2.7%
 
16000 14165 2.7%
 
Other values (1360) 310170 58.3%
 

Minimum 5 values

ValueCountFrequency (%) 
500 7 < 0.1%
 
550 1 < 0.1%
 
600 3 < 0.1%
 
700 3 < 0.1%
 
725 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
35000 21650 4.1%
 
34975 20 < 0.1%
 
34950 12 < 0.1%
 
34925 5 < 0.1%
 
34900 7 < 0.1%
 

funded_amnt_inv
Highly correlated

This variable is highly correlated with funded_amnt and should be ignored for analysis

Correlation0.9980407483

grade
Categorical

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
B
152713
C
147499
A
89107
Other values (4)
143109
ValueCountFrequency (%) 
B 152713 28.7%
 
C 147499 27.7%
 
A 89107 16.7%
 
D 83567 15.7%
 
E 42495 8.0%
 
F 13826 2.6%
 
G 3221 0.6%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

home_ownership
Categorical

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
MORTGAGE
265940
RENT
213668
OWN
 
52664
Other values (3)
 
156
ValueCountFrequency (%) 
MORTGAGE 265940 49.9%
 
RENT 213668 40.1%
 
OWN 52664 9.9%
 
OTHER 117 < 0.1%
 
NONE 36 < 0.1%
 
ANY 3 < 0.1%
 
Max length8
Mean length5.899242715
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

initial_list_status
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
f
274018
w
258410
ValueCountFrequency (%) 
f 274018 51.5%
 
w 258410 48.5%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

inq_last_6mths
Numeric

Distinct count24
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean0.6946030518
Minimum0
Maximum31
Zeros (%)56.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q31
95-th percentile3
Maximum31
Range31
Interquartile range1

Descriptive statistics

Standard deviation0.9970254763
Coef of variation1.435388851
Kurtosis9.833191398
Mean0.6946030518
MAD0.7797904647
Skewness2.034336168
Sum369815
Variance0.9940598004
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%) 
0 298854 56.1%
 
1 144735 27.2%
 
2 56369 10.6%
 
3 22548 4.2%
 
4 6533 1.2%
 
5 2397 0.5%
 
6 720 0.1%
 
7 99 < 0.1%
 
8 79 < 0.1%
 
9 26 < 0.1%
 
Other values (13) 52 < 0.1%
 
(Missing) 16 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 298854 56.1%
 
1 144735 27.2%
 
2 56369 10.6%
 
3 22548 4.2%
 
4 6533 1.2%
 

Maximum 5 values

ValueCountFrequency (%) 
31 1 < 0.1%
 
28 1 < 0.1%
 
24 2 < 0.1%
 
20 1 < 0.1%
 
18 2 < 0.1%
 

int_rate
Numeric

Distinct count535
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean13.24296878
Minimum5.32
Maximum28.99
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum5.32
5-th percentile6.62
Q19.99
Median12.99
Q316.2
95-th percentile20.99
Maximum28.99
Range23.67
Interquartile range6.21

Descriptive statistics

Standard deviation4.379611104
Coef of variation0.3307121823
Kurtosis-0.1610721311
Mean13.24296878
MAD3.51084324
Skewness0.4284533005
Sum7050927.38
Variance19.18099342
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 5.32 5.37 5.605 5.86 5.96 ... 27.4 27.685 27.935 28.24 28.99 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10.99 20609 3.9%
 
9.17 15436 2.9%
 
15.61 15207 2.9%
 
9.99 13054 2.5%
 
7.89 12219 2.3%
 
13.99 11332 2.1%
 
12.69 11317 2.1%
 
12.29 11233 2.1%
 
12.99 11083 2.1%
 
17.57 10754 2.0%
 
Other values (525) 400184 75.2%
 

Minimum 5 values

ValueCountFrequency (%) 
5.32 5751 1.1%
 
5.42 335 0.1%
 
5.79 240 < 0.1%
 
5.93 1078 0.2%
 
5.99 193 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
28.99 67 < 0.1%
 
28.49 80 < 0.1%
 
27.99 2 < 0.1%
 
27.88 130 < 0.1%
 
27.49 4 < 0.1%
 

last_week_pay
Categorical

Distinct count98
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
13th week
 
30333
9th week
 
28626
26th week
 
27475
Other values (95)
445994
ValueCountFrequency (%) 
13th week 30333 5.7%
 
9th week 28626 5.4%
 
26th week 27475 5.2%
 
22th week 26000 4.9%
 
4th week 25704 4.8%
 
35th week 24037 4.5%
 
39th week 23796 4.5%
 
17th week 22036 4.1%
 
31th week 21437 4.0%
 
52th week 19391 3.6%
 
Other values (88) 283593 53.3%
 
Max length10
Mean length9.075619614
Min length8
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

loan_amnt
Highly correlated

This variable is highly correlated with funded_amnt_inv and should be ignored for analysis

Correlation0.9971235007

loan_status
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
406601
1
125827
ValueCountFrequency (%) 
0 406601 76.4%
 
1 125827 23.6%
 

member_id
Numeric

Distinct count532428
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean35005472.35
Minimum70473
Maximum73544841
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum70473
5-th percentile1321077
Q110866882.5
Median37095895
Q358489200.75
95-th percentile70386197.45
Maximum73544841
Range73474368
Interquartile range47622318.25

Descriptive statistics

Standard deviation24121476.52
Coef of variation0.6890773042
Kurtosis-1.467026931
Mean35005472.35
MAD21732127.66
Skewness0.01736816237
Sum1.863789363e+13
Variance5.818456293e+14
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[7.04730000e+04 1.17900000e+05 2.32335500e+05 3.18281500e+05 3.46476000e+05 ... 7.35059120e+07 7.35099185e+07 7.35158380e+07 7.35198200e+07 7.35448410e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
26216447 1 < 0.1%
 
5308085 1 < 0.1%
 
69615213 1 < 0.1%
 
32908910 1 < 0.1%
 
822758 1 < 0.1%
 
53935733 1 < 0.1%
 
57075318 1 < 0.1%
 
46962149 1 < 0.1%
 
71259567 1 < 0.1%
 
9256974 1 < 0.1%
 
Other values (532418) 532418 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
70473 1 < 0.1%
 
70681 1 < 0.1%
 
70694 1 < 0.1%
 
70735 1 < 0.1%
 
70978 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
73544841 1 < 0.1%
 
73542831 1 < 0.1%
 
73519894 1 < 0.1%
 
73519746 1 < 0.1%
 
73519699 1 < 0.1%
 

mths_since_last_delinq
Numeric

Distinct count148
Unique (%)< 0.1%
Missing (%)51.2%
Missing (n)272554
Infinite (%)0.0%
Infinite (n)0
Mean34.0557347
Minimum0
Maximum180
Zeros (%)0.2%
Mini histogram

Quantile statistics

Minimum0
5-th percentile5
Q115
Median31
Q350
95-th percentile74
Maximum180
Range180
Interquartile range35

Descriptive statistics

Standard deviation21.88479738
Coef of variation0.6426170973
Kurtosis-0.7640527973
Mean34.0557347
MAD18.48967387
Skewness0.4555285283
Sum8850200
Variance478.9443564
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
9 5163 1.0%
 
6 5125 1.0%
 
12 5037 0.9%
 
7 5032 0.9%
 
13 5017 0.9%
 
8 4988 0.9%
 
10 4933 0.9%
 
15 4818 0.9%
 
14 4812 0.9%
 
11 4730 0.9%
 
Other values (137) 210219 39.5%
 
(Missing) 272554 51.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 1025 0.2%
 
1 1967 0.4%
 
2 2405 0.5%
 
3 2900 0.5%
 
4 3572 0.7%
 

Maximum 5 values

ValueCountFrequency (%) 
180 1 < 0.1%
 
176 1 < 0.1%
 
171 1 < 0.1%
 
170 2 < 0.1%
 
159 1 < 0.1%
 

mths_since_last_major_derog
Numeric

Distinct count163
Unique (%)< 0.1%
Missing (%)75.0%
Missing (n)399448
Infinite (%)0.0%
Infinite (n)0
Mean44.12146187
Minimum0
Maximum180
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile9
Q127
Median44
Q361
95-th percentile77
Maximum180
Range180
Interquartile range34

Descriptive statistics

Standard deviation22.19840987
Coef of variation0.5031204527
Kurtosis-0.02750251233
Mean44.12146187
MAD18.44583628
Skewness0.26586312
Sum5867272
Variance492.7694008
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
45 2104 0.4%
 
46 2092 0.4%
 
42 2062 0.4%
 
43 2042 0.4%
 
37 2015 0.4%
 
48 1995 0.4%
 
38 1994 0.4%
 
44 1984 0.4%
 
40 1979 0.4%
 
47 1955 0.4%
 
Other values (152) 112758 21.2%
 
(Missing) 399448 75.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 84 < 0.1%
 
1 370 0.1%
 
2 368 0.1%
 
3 412 0.1%
 
4 609 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
180 1 < 0.1%
 
176 1 < 0.1%
 
171 1 < 0.1%
 
170 2 < 0.1%
 
165 1 < 0.1%
 

mths_since_last_record
Numeric

Distinct count123
Unique (%)< 0.1%
Missing (%)84.6%
Missing (n)450305
Infinite (%)0.0%
Infinite (n)0
Mean70.09306772
Minimum0
Maximum121
Zeros (%)0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile21
Q151
Median70
Q392
95-th percentile114
Maximum121
Range121
Interquartile range41

Descriptive statistics

Standard deviation28.13921926
Coef of variation0.4014550965
Kurtosis-0.5866892462
Mean70.09306772
MAD23.0328709
Skewness-0.2003172547
Sum5756253
Variance791.8156606
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
61 1180 0.2%
 
68 1166 0.2%
 
62 1157 0.2%
 
65 1156 0.2%
 
69 1156 0.2%
 
67 1156 0.2%
 
71 1151 0.2%
 
64 1133 0.2%
 
63 1123 0.2%
 
70 1115 0.2%
 
Other values (112) 70630 13.3%
 
(Missing) 450305 84.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 755 0.1%
 
1 45 < 0.1%
 
2 36 < 0.1%
 
3 71 < 0.1%
 
4 95 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
121 2 < 0.1%
 
120 8 < 0.1%
 
119 531 0.1%
 
118 876 0.2%
 
117 898 0.2%
 

open_acc
Numeric

Distinct count74
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean11.54559439
Minimum0
Maximum90
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile5
Q18
Median11
Q314
95-th percentile21
Maximum90
Range90
Interquartile range6

Descriptive statistics

Standard deviation5.311442341
Coef of variation0.4600406148
Kurtosis3.170505953
Mean11.54559439
MAD4.063281959
Skewness1.251513089
Sum6147013
Variance28.21141974
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
9 48345 9.1%
 
10 47212 8.9%
 
8 46065 8.7%
 
11 43384 8.1%
 
7 40501 7.6%
 
12 38666 7.3%
 
13 33986 6.4%
 
6 33532 6.3%
 
14 28771 5.4%
 
5 23935 4.5%
 
Other values (63) 148015 27.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 4 < 0.1%
 
1 137 < 0.1%
 
2 1654 0.3%
 
3 5741 1.1%
 
4 13991 2.6%
 

Maximum 5 values

ValueCountFrequency (%) 
90 1 < 0.1%
 
84 1 < 0.1%
 
82 1 < 0.1%
 
79 1 < 0.1%
 
76 1 < 0.1%
 

pub_rec
Numeric

Distinct count29
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean0.194858493
Minimum0
Maximum86
Zeros (%)84.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile1
Maximum86
Range86
Interquartile range0

Descriptive statistics

Standard deviation0.583822182
Coef of variation2.996134132
Kurtosis1337.011555
Mean0.194858493
MAD0.3301539962
Skewness15.31822224
Sum103745
Variance0.3408483402
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%) 
0 451040 84.7%
 
1 67938 12.8%
 
2 8841 1.7%
 
3 2692 0.5%
 
4 974 0.2%
 
5 424 0.1%
 
6 238 < 0.1%
 
7 97 < 0.1%
 
8 67 < 0.1%
 
9 23 < 0.1%
 
Other values (18) 78 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 451040 84.7%
 
1 67938 12.8%
 
2 8841 1.7%
 
3 2692 0.5%
 
4 974 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
86 1 < 0.1%
 
63 1 < 0.1%
 
49 1 < 0.1%
 
40 1 < 0.1%
 
28 1 < 0.1%
 

purpose
Categorical

Distinct count14
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
debt_consolidation
314989
credit_card
123670
home_improvement
 
31087
Other values (11)
 
62682
ValueCountFrequency (%) 
debt_consolidation 314989 59.2%
 
credit_card 123670 23.2%
 
home_improvement 31087 5.8%
 
other 25652 4.8%
 
major_purchase 10284 1.9%
 
small_business 6146 1.2%
 
car 5266 1.0%
 
medical 5117 1.0%
 
moving 3243 0.6%
 
vacation 2812 0.5%
 
Other values (4) 4162 0.8%
 
Max length18
Mean length15.04095953
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

pymnt_plan
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
n
532420
y
 
8
ValueCountFrequency (%) 
n 532420 > 99.9%
 
y 8 < 0.1%
 

recoveries
Numeric

Distinct count14024
Unique (%)2.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean45.7178317
Minimum0
Maximum33520.27
Zeros (%)97.2%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum33520.27
Range33520.27
Interquartile range0

Descriptive statistics

Standard deviation409.6474671
Coef of variation8.960343302
Kurtosis624.6296143
Mean45.7178317
MAD89.02160815
Skewness18.07600407
Sum24341453.7
Variance167811.0473
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 1.250000e-01 8.270000e+00 9.665000e+00 1.442500e+01 ... 7.060275e+03 9.129405e+03 1.296968e+04 2.306385e+04 3.352027e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 517723 97.2%
 
200 7 < 0.1%
 
11.7 6 < 0.1%
 
10.08 5 < 0.1%
 
3000 5 < 0.1%
 
13.2 5 < 0.1%
 
10.2 5 < 0.1%
 
1200 5 < 0.1%
 
13.5 5 < 0.1%
 
800 5 < 0.1%
 
Other values (14014) 14657 2.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 517723 97.2%
 
0.25 1 < 0.1%
 
2.34 1 < 0.1%
 
3.6 1 < 0.1%
 
6.3 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
33520.27 1 < 0.1%
 
31900.52 1 < 0.1%
 
29623.35 1 < 0.1%
 
26308.47 1 < 0.1%
 
23184.33 1 < 0.1%
 

revol_bal
Numeric

Distinct count63459
Unique (%)11.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean16921.28032
Minimum0
Maximum2568995
Zeros (%)0.4%
Mini histogram

Quantile statistics

Minimum0
5-th percentile2023
Q16444
Median11876
Q320843
95-th percentile43927
Maximum2568995
Range2568995
Interquartile range14399

Descriptive statistics

Standard deviation22423.21584
Coef of variation1.32514889
Kurtosis946.9518118
Mean16921.28032
MAD11410.54285
Skewness15.98021197
Sum9009363440
Variance502800608.4
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 7.5000000e+00 2.6500000e+01 1.0350000e+02 ... 4.1199050e+05 5.1045350e+05 6.9280100e+05 1.0418815e+06 2.5689950e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2017 0.4%
 
5235 52 < 0.1%
 
5399 46 < 0.1%
 
5466 45 < 0.1%
 
5853 45 < 0.1%
 
4136 44 < 0.1%
 
7792 44 < 0.1%
 
6189 44 < 0.1%
 
5886 43 < 0.1%
 
6052 43 < 0.1%
 
Other values (63449) 530005 99.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2017 0.4%
 
1 33 < 0.1%
 
2 23 < 0.1%
 
3 32 < 0.1%
 
4 23 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2568995 1 < 0.1%
 
2560703 1 < 0.1%
 
1746716 1 < 0.1%
 
1743266 1 < 0.1%
 
1190046 1 < 0.1%
 

revol_util
Numeric

Distinct count1266
Unique (%)0.2%
Missing (%)0.1%
Missing (n)287
Infinite (%)0.0%
Infinite (n)0
Mean55.05718917
Minimum0
Maximum892.3
Zeros (%)0.4%
Mini histogram

Quantile statistics

Minimum0
5-th percentile13.8
Q137.7
Median56
Q373.6
95-th percentile92.5
Maximum892.3
Range892.3
Interquartile range35.9

Descriptive statistics

Standard deviation23.8534365
Coef of variation0.4332483525
Kurtosis2.178742477
Mean55.05718917
MAD19.78202802
Skewness-0.08961858431
Sum29298187.7
Variance568.9864328
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 2137 0.4%
 
53 1124 0.2%
 
58 1082 0.2%
 
52 1054 0.2%
 
62 1047 0.2%
 
48 1045 0.2%
 
55 1040 0.2%
 
54 1023 0.2%
 
61 1021 0.2%
 
59 1018 0.2%
 
Other values (1255) 520550 97.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2137 0.4%
 
0.03 1 < 0.1%
 
0.05 1 < 0.1%
 
0.1 237 < 0.1%
 
0.12 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
892.3 1 < 0.1%
 
366.6 1 < 0.1%
 
193 1 < 0.1%
 
184.6 1 < 0.1%
 
166.9 1 < 0.1%
 

sub_grade
Categorical

Distinct count35
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
B3
 
33844
B4
 
33198
C1
 
31975
Other values (32)
433411
ValueCountFrequency (%) 
B3 33844 6.4%
 
B4 33198 6.2%
 
C1 31975 6.0%
 
C2 31356 5.9%
 
C3 30080 5.6%
 
B2 29390 5.5%
 
B5 29313 5.5%
 
C4 29103 5.5%
 
A5 27016 5.1%
 
B1 26968 5.1%
 
Other values (25) 230185 43.2%
 
Max length2
Mean length2
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

term
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
36 months
372793
60 months
159635
ValueCountFrequency (%) 
36 months 372793 70.0%
 
60 months 159635 30.0%
 
Max length9
Mean length9
Min length9
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

title
Categorical

Distinct count39694
Unique (%)7.5%
Missing (%)< 0.1%
Missing (n)90
Debt consolidation
248967
Credit card refinancing
98582
Home improvement
 
24057
Other values (39690)
160732
ValueCountFrequency (%) 
Debt consolidation 248967 46.8%
 
Credit card refinancing 98582 18.5%
 
Home improvement 24057 4.5%
 
Other 19053 3.6%
 
Debt Consolidation 9932 1.9%
 
Major purchase 7195 1.4%
 
Medical expenses 3997 0.8%
 
Business 3962 0.7%
 
Consolidation 3354 0.6%
 
Car financing 3333 0.6%
 
Other values (39683) 109906 20.6%
 
Max length80
Mean length17.72961602
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

tot_coll_amt
Numeric

Distinct count8082
Unique (%)1.5%
Missing (%)7.9%
Missing (n)42004
Infinite (%)0.0%
Infinite (n)0
Mean213.5622217
Minimum0
Maximum496651
Zeros (%)79.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile634
Maximum496651
Range496651
Interquartile range0

Descriptive statistics

Standard deviation1958.571538
Coef of variation9.170964429
Kurtosis10577.26556
Mean213.5622217
MAD377.3924124
Skewness61.64220691
Sum104736039
Variance3836002.471
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 420903 79.1%
 
50 1053 0.2%
 
100 815 0.2%
 
75 631 0.1%
 
150 418 0.1%
 
200 409 0.1%
 
60 392 0.1%
 
80 374 0.1%
 
70 349 0.1%
 
55 327 0.1%
 
Other values (8071) 64753 12.2%
 
(Missing) 42004 7.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 420903 79.1%
 
2 3 < 0.1%
 
4 1 < 0.1%
 
7 2 < 0.1%
 
9 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
496651 1 < 0.1%
 
296368 1 < 0.1%
 
227157 1 < 0.1%
 
169257 1 < 0.1%
 
143558 1 < 0.1%
 

tot_cur_bal
Numeric

Distinct count251641
Unique (%)47.3%
Missing (%)7.9%
Missing (n)42004
Infinite (%)0.0%
Infinite (n)0
Mean139554.1108
Minimum0
Maximum8000078
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile9217
Q129839.75
Median80669.5
Q3208479.25
95-th percentile424251.95
Maximum8000078
Range8000078
Interquartile range178639.5

Descriptive statistics

Standard deviation153914.8774
Coef of variation1.102904648
Kurtosis34.66750078
Mean139554.1108
MAD113658.531
Skewness3.011754729
Sum6.844068523e+10
Variance2.36897895e+10
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 78 < 0.1%
 
14511 17 < 0.1%
 
5537 15 < 0.1%
 
27316 15 < 0.1%
 
19998 15 < 0.1%
 
29232 14 < 0.1%
 
22611 14 < 0.1%
 
23346 14 < 0.1%
 
23254 14 < 0.1%
 
14535 14 < 0.1%
 
Other values (251630) 490214 92.1%
 
(Missing) 42004 7.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 78 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
4 3 < 0.1%
 
5 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
8000078 1 < 0.1%
 
4772549 1 < 0.1%
 
4026405 1 < 0.1%
 
3881449 1 < 0.1%
 
3840795 1 < 0.1%
 

total_acc
Numeric

Distinct count127
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)16
Infinite (%)0.0%
Infinite (n)0
Mean25.26735686
Minimum1
Maximum162
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile9
Q117
Median24
Q332
95-th percentile47
Maximum162
Range161
Interquartile range15

Descriptive statistics

Standard deviation11.84321082
Coef of variation0.4687158568
Kurtosis1.35541323
Mean25.26735686
MAD9.298095236
Skewness0.8931946471
Sum13452644
Variance140.2616425
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20 19338 3.6%
 
22 19266 3.6%
 
21 19099 3.6%
 
19 19056 3.6%
 
18 18738 3.5%
 
23 18703 3.5%
 
24 18623 3.5%
 
17 18536 3.5%
 
25 17846 3.4%
 
16 17448 3.3%
 
Other values (116) 345759 64.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 13 < 0.1%
 
2 35 < 0.1%
 
3 283 0.1%
 
4 1661 0.3%
 
5 2831 0.5%
 

Maximum 5 values

ValueCountFrequency (%) 
162 1 < 0.1%
 
156 1 < 0.1%
 
140 1 < 0.1%
 
138 1 < 0.1%
 
135 1 < 0.1%
 

total_rec_int
Numeric

Distinct count243149
Unique (%)45.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1753.428788
Minimum0
Maximum24205.62
Zeros (%)2.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile80.19
Q1441.6
Median1072.69
Q32234.735
95-th percentile5889.503
Maximum24205.62
Range24205.62
Interquartile range1793.135

Descriptive statistics

Standard deviation2093.199837
Coef of variation1.19377522
Kurtosis11.64275416
Mean1753.428788
MAD1404.923078
Skewness2.833204854
Sum933574582.7
Variance4381485.558
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-03 4.8200000e+00 8.2400000e+00 1.6920000e+01 ... 1.5898660e+04 1.7616215e+04 1.9342250e+04 2.2021040e+04 2.4205620e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 10953 2.1%
 
82.76 86 < 0.1%
 
174.9 78 < 0.1%
 
283.29 77 < 0.1%
 
319.41 76 < 0.1%
 
41.37 75 < 0.1%
 
71.33 75 < 0.1%
 
451.32 70 < 0.1%
 
130.12 67 < 0.1%
 
475.52 64 < 0.1%
 
Other values (243139) 520807 97.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 10953 2.1%
 
0.01 7 < 0.1%
 
0.56 1 < 0.1%
 
0.59 1 < 0.1%
 
0.62 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
24205.62 1 < 0.1%
 
23450.38 1 < 0.1%
 
23172.31 1 < 0.1%
 
23171.65 1 < 0.1%
 
22886.89 1 < 0.1%
 

total_rec_late_fee
Numeric

Distinct count4073
Unique (%)0.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.3949535031
Minimum0
Maximum358.68
Zeros (%)98.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum358.68
Range358.68
Interquartile range0

Descriptive statistics

Standard deviation4.0915461
Coef of variation10.35956402
Kurtosis545.4515301
Mean0.3949535031
MAD0.7788762497
Skewness17.68583522
Sum210284.3037
Variance16.74074949
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-03 3.95000000e-01 1.45150316e+01 1.48685617e+01 ... 7.49343786e+01 7.50400000e+01 1.05705000e+02 1.47930000e+02 3.58680000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 524986 98.6%
 
15 1680 0.3%
 
30 219 < 0.1%
 
45 31 < 0.1%
 
18.87 10 < 0.1%
 
16.37 10 < 0.1%
 
18.02 9 < 0.1%
 
20.6 9 < 0.1%
 
16.84 9 < 0.1%
 
20.5 9 < 0.1%
 
Other values (4063) 5456 1.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 524986 98.6%
 
0.01 3 < 0.1%
 
0.1017045619 1 < 0.1%
 
0.1357656267 1 < 0.1%
 
0.1800829035 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
358.68 1 < 0.1%
 
294.68 1 < 0.1%
 
252.8 1 < 0.1%
 
229.75 1 < 0.1%
 
213.3 1 < 0.1%
 

total_rev_hi_lim
Numeric

Distinct count14698
Unique (%)2.8%
Missing (%)7.9%
Missing (n)42004
Infinite (%)0.0%
Infinite (n)0
Mean32080.57292
Minimum0
Maximum9999999
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile6000
Q114000
Median23700
Q339800
95-th percentile83000
Maximum9999999
Range9999999
Interquartile range25800

Descriptive statistics

Standard deviation38053.03531
Coef of variation1.18617069
Kurtosis19273.88276
Mean32080.57292
MAD19614.31186
Skewness77.3834345
Sum1.573308289e+10
Variance1448033496
Memory size4.1 MiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
13500 1585 0.3%
 
14500 1576 0.3%
 
10000 1569 0.3%
 
12000 1564 0.3%
 
15000 1557 0.3%
 
14000 1536 0.3%
 
15500 1533 0.3%
 
13000 1532 0.3%
 
16500 1528 0.3%
 
12500 1525 0.3%
 
Other values (14687) 474919 89.2%
 
(Missing) 42004 7.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 161 < 0.1%
 
100 4 < 0.1%
 
200 5 < 0.1%
 
300 46 < 0.1%
 
400 23 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9999999 2 < 0.1%
 
2013133 1 < 0.1%
 
1998700 1 < 0.1%
 
1314900 1 < 0.1%
 
1200500 1 < 0.1%
 

verification_status
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Source Verified
197750
Verified
174702
Not Verified
159976
ValueCountFrequency (%) 
Source Verified 197750 37.1%
 
Verified 174702 32.8%
 
Not Verified 159976 30.0%
 
Max length15
Mean length11.80174221
Min length8
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

verification_status_joint
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)99.9%
Missing (n)532123
Not Verified
 
170
Verified
 
102
Source Verified
 
33
(Missing)
532123
ValueCountFrequency (%) 
Not Verified 170 < 0.1%
 
Verified 102 < 0.1%
 
Source Verified 33 < 0.1%
 
(Missing) 532123 99.9%
 
Max length15
Mean length3.004575267
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

zip_code
Categorical

Distinct count917
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
945xx
 
5845
750xx
 
5680
112xx
 
5632
Other values (914)
515271
ValueCountFrequency (%) 
945xx 5845 1.1%
 
750xx 5680 1.1%
 
112xx 5632 1.1%
 
606xx 5176 1.0%
 
300xx 4757 0.9%
 
100xx 4563 0.9%
 
900xx 4500 0.8%
 
070xx 4486 0.8%
 
331xx 4475 0.8%
 
770xx 4177 0.8%
 
Other values (907) 483137 90.7%
 
Max length5
Mean length5
Min length5
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

Correlations

Missing values

Sample

First rows

acc_now_delinqaddr_stateannual_incapplication_typebatch_enrolledcollection_recovery_feecollections_12_mths_ex_meddelinq_2yrsdescdtiemp_lengthemp_titlefunded_amntfunded_amnt_invgradehome_ownershipinitial_list_statusinq_last_6mthsint_ratelast_week_payloan_amntloan_statusmember_idmths_since_last_delinqmths_since_last_major_derogmths_since_last_recordopen_accpub_recpurposepymnt_planrecoveriesrevol_balrevol_utilsub_gradetermtitletot_coll_amttot_cur_baltotal_acctotal_rec_inttotal_rec_late_feetotal_rev_hi_limverification_statusverification_status_jointzip_code
00.0FL28700.0INDIVIDUAL0.00.00.0NaN33.889 yearsclerk1435014350.0EOWNf1.019.1926th week1435005818933650.074.075.014.01.0debt_consolidationn0.022515.073.1E336 monthsDebt consolidation0.028699.028.01173.840.030800.0Source VerifiedNaN349xx
10.0MD65000.0INDIVIDUALBAT15865990.00.00.0NaN3.64< 1 yearHuman Resources Specialist48004800.0BMORTGAGEw1.010.999th week4800070011223NaNNaNNaN6.00.0home_improvementn0.07624.023.2B436 monthsHome improvement0.09974.013.083.950.032900.0Source VerifiedNaN209xx
20.0OH45000.0INDIVIDUALBAT15865990.00.00.0NaN18.422 yearsDriver1000010000.0AOWNw0.07.269th week10000070255675NaNNaNNaN5.00.0debt_consolidationn0.010877.031.2A436 monthsDebt consolidation65.038295.019.056.470.034900.0Not VerifiedNaN447xx
30.0VA105000.0INDIVIDUALBAT48080220.00.00.0> My goal is to obtain a loan to pay off my high credit cards and get out of debt within 3 years.<br>14.9710+ yearsUs office of Personnel Management1500015000.0DRENTf2.019.72135th week150000189393646.0NaNNaN10.00.0debt_consolidationn0.013712.055.5D536 monthsDebt consolidation0.055564.021.04858.620.024700.0Not VerifiedNaN221xx
40.0CA52000.0INDIVIDUALBAT28336420.00.00.0NaN20.1610+ yearsLAUSD-HOLLYWOOD HIGH SCHOOL1600016000.0BRENTw0.010.6496th week1600007652106NaNNaNNaN11.00.0credit_cardn0.035835.076.2B236 monthsrefi0.047159.027.02296.410.047033.0VerifiedNaN900xx
50.0IN120000.0INDIVIDUALBAT25755490.00.00.0> We are requesting this loan to help re-organize our finances after having a tumultuous year that resulted in unexpected medical bills. I have been in the same line of work for over five years and have been at my current company, which is very stable in the marketplace, for two and a half years.<br>12.302 yearsDesign Consultant1500014950.0AMORTGAGEf0.08.90113th week1500001024726856.0NaNNaN18.00.0debt_consolidationn0.019040.064.5A536 monthsCredit Card Debt Consolidation0.0350619.030.01957.240.029500.0Not VerifiedNaN461xx
60.0CA75000.0INDIVIDUAL0.00.00.0> Funds will be used to pay off a debt. I am also a lender as well here on LC.<br>5.705 yearsTOYOTA OF NORTH HOLLYWOOD50004975.0ARENTf0.07.90117th week500018089625NaNNaN105.013.02.0debt_consolidationn0.013272.023.9A436 monthsPAY THEM OFF1023.013272.023.0578.360.055500.0Source VerifiedNaN913xx
70.0AL54000.0INDIVIDUALNaN0.00.00.0NaN11.638 yearsBanker60006000.0BMORTGAGEf1.09.1778th week600002304311646.054.0NaN13.00.0credit_cardn0.03484.029.5B136 monthsCredit card refinancing0.0272579.049.0637.510.011800.0Not VerifiedNaN351xx
80.0CA92000.0INDIVIDUALBAT41361520.00.00.0NaN30.857 yearsLVN60006000.0CMORTGAGEw0.013.9944th week600004590093377.0NaNNaN16.00.0home_improvementn0.047567.076.6C436 monthsHome improvement0.0281521.027.0621.720.062100.0Not VerifiedNaN917xx
90.0KY72000.0INDIVIDUALBAT46945720.00.00.0NaN33.922 yearsRegistered Nurse3455034550.0DMORTGAGEw0.017.1452th week34550041272507NaNNaNNaN12.00.0debt_consolidationn0.030040.090.5D460 monthsDebt consolidation0.076034.030.05535.460.033200.0VerifiedNaN427xx

Last rows

acc_now_delinqaddr_stateannual_incapplication_typebatch_enrolledcollection_recovery_feecollections_12_mths_ex_meddelinq_2yrsdescdtiemp_lengthemp_titlefunded_amntfunded_amnt_invgradehome_ownershipinitial_list_statusinq_last_6mthsint_ratelast_week_payloan_amntloan_statusmember_idmths_since_last_delinqmths_since_last_major_derogmths_since_last_recordopen_accpub_recpurposepymnt_planrecoveriesrevol_balrevol_utilsub_gradetermtitletot_coll_amttot_cur_baltotal_acctotal_rec_inttotal_rec_late_feetotal_rev_hi_limverification_statusverification_status_jointzip_code
5324180.0UT54000.0INDIVIDUALNaN0.00.00.0NaN25.079 yearsTeacher1000010000.0BRENTw0.09.764th week10000071054913NaNNaNNaN11.00.0credit_cardn0.013856.070.3B336 monthsCredit card refinancing890.0121294.035.075.910.019700.0Source VerifiedNaN840xx
5324190.0RI45000.0INDIVIDUALBAT11846940.00.01.0NaN7.97< 1 yearHelp Desk Technician48004800.0BRENTw0.011.5331th week48000517744686.0NaNNaN14.00.0credit_cardn0.04557.043.8B536 monthsCredit card refinancing0.024985.023.0308.180.010400.0Not VerifiedNaN028xx
5324200.0PA37000.0INDIVIDUALBAT54896740.00.00.0NaN25.369 yearsAdmin Secretary57005700.0COWNf3.014.164th week570011666149149.0NaNNaN10.00.0debt_consolidationn0.09944.063.7C236 monthsDebt consolidation0.089997.035.067.260.015600.0Source VerifiedNaN178xx
5324210.0VA55000.0INDIVIDUALBAT20789740.00.01.0NaN4.653 yearsTeacher1300013000.0CMORTGAGEf0.012.9965th week130000320975387.07.0NaN11.00.0debt_consolidationn0.06638.024.7C160 monthsDebt consolidation0.011231.025.01916.770.026900.0Source VerifiedNaN239xx
5324220.0WA49900.0INDIVIDUALNaN0.00.00.0NaN20.2010+ yearsChampion Mortgage1000010000.0CMORTGAGEw1.014.3322th week1000018219842NaNNaNNaN13.00.0debt_consolidationn0.014510.060.0C236 monthsConsolidation0.0189233.017.0569.120.024200.0Source VerifiedNaN984xx
5324230.0MI75000.0INDIVIDUAL0.00.00.0NaN14.5310+ yearsRegistered Nurse2000020000.0BMORTGAGEf0.012.4965th week20000031296187NaNNaN51.012.01.0debt_consolidationn0.015775.063.6B536 monthsDebt consolidation0.083087.034.02595.450.024800.0Source VerifiedNaN481xx
5324240.0MI59000.0INDIVIDUALBAT20038480.00.00.0NaN22.9710+ yearsAccount Mgr1200012000.0CMORTGAGEw0.014.9970th week12000029403184NaNNaN81.010.01.0debt_consolidationn0.09453.053.1C560 monthsDebt consolidation0.0227812.029.02182.920.017800.0Not VerifiedNaN496xx
5324250.0TN42504.0INDIVIDUALNaN0.00.00.0NaN27.278 yearsComcast cable1872518725.0ERENTf1.020.809th week187251735760726.0NaNNaN14.00.0debt_consolidationn0.012085.049.9E160 monthsDebt consolidation0.026010.026.0645.320.024200.0VerifiedNaN370xx
5324260.0OH50000.0INDIVIDUALBAT31936890.00.00.0NaN14.911 yearResident Physician2100021000.0DRENTw1.016.2978th week21000023182668NaNNaNNaN7.00.0credit_cardn0.020902.089.7D260 monthsCredit card refinancing0.029197.014.04619.790.023300.0Source VerifiedNaN432xx
5324270.0CA53000.0INDIVIDUALBAT41361520.00.00.0NaN17.80< 1 yearHealth Care Analyst1000010000.0ARENTf0.06.3944th week10000046122259NaNNaNNaN11.00.0debt_consolidationn0.010058.046.4A236 monthsDebt consolidation0.047866.020.0467.520.021700.0Not VerifiedNaN956xx